Making Useful Conflict Predictions: Methods for Addressing Skewed Classes and Implementing Cost-Sensitive Learning in the Study of State Failure
نویسندگان
چکیده
One of the major issues in predicting state failure is the relatively rare occurrence of event onset. This class skew problem can cause difficulties in both estimating a model and selecting a decision boundary. Since the publication of King and Zeng’s (2001) study, scholars have utilized case-control methods to address this issue. This paper re-analyzes the landmark research of the Political Instability Task Force (Goldstone et al. 2010), comparing the casecontrol approach to several other methods from the machine learning field and some original to this study. Case-control methods are outperformed by almost all of the alternatives. A multilevel model on the raw data performs best. The article also introduces cost sensitive methods for determining a decision boundary. This explication reveals problems in the Task Force’s formulation of a decision boundary and suggests methods for making useful predictions for policy.
منابع مشابه
An Investigation of the Relationship between L2 Learning Styles and Teaching Methodologies in EFL Classes
Individual differences have always been a key element in the success and failure of learners in language classrooms. Learners come to EFL classes with various learning styles and teachers utilize different methodologies targeting different needs of the learners which may have important effects on the quality of the learning environment. In this study a comparison is made between learning styles...
متن کاملDeveloping and Implementing Log Book in Teaching Principles and Techniques to Nursing and Midwifery Students: Mixed Method Study
Background: There is an interval between clinical and theoretical teachings in nursing which proper teachings during initial courses in nursing. Therefore, the purpose of this process was to comply Log Book in teaching principles and techniques to nursing students. Methods: This mixed study was an exploratory study which was done in three stages on midwifery and nursing students. At first, Log ...
متن کاملA New Formulation for Cost-Sensitive Two Group Support Vector Machine with Multiple Error Rate
Support vector machine (SVM) is a popular classification technique which classifies data using a max-margin separator hyperplane. The normal vector and bias of the mentioned hyperplane is determined by solving a quadratic model implies that SVM training confronts by an optimization problem. Among of the extensions of SVM, cost-sensitive scheme refers to a model with multiple costs which conside...
متن کاملData Mining Performance in Identifying the Risk Factors of Early Arteriovenous Fistula Failure in Hemodialysis Patients
Background and Objectives: Arteriovenous fistula is a popular vascular access method for surgical treatment of hemodialysis patients. The method, however, is associated with a high rate of early failure varying in the range of 20-60%. Predicting early Arteriovenous fistula failure and its risk factors can help reduce its incidence, its hospitalization rate, and associated costs. In this study, ...
متن کاملCredit Card Fraud Detection using Data mining and Statistical Methods
Due to today’s advancement in technology and businesses, fraud detection has become a critical component of financial transactions. Considering vast amounts of data in large datasets, it becomes more difficult to detect fraud transactions manually. In this research, we propose a combined method using both data mining and statistical tasks, utilizing feature selection, resampling and cost-...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014